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=> fil reg; d que 12 

FILE ' REGISTRY 1 ENTERED AT 15:52:41 ON 14 APR 2004 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2004 American Chemical Society (ACS) 

Property values tagged with IC are from the ZIC/VINITI data file 
provided by InfoChem. 

STRUCTURE FILE UPDATES: 13 APR 2004 HIGHEST RN 675103-21-6 
DICTIONARY FILE UPDATES: 13 APR 2004 HIGHEST RN 675103-21-6 

TSCA INFORMATION NOW CURRENT THROUGH JANUARY 6, 2004 

Please note that search-term pricing does apply when 

conducting SmartSELECT searches. ' 

Crossover limits have been increased. See HELP CROSSOVER for details. 

Experimental and calculated property data are now available. For more 
information enter HELP PROP at an arrow prompt in the file or refer 
to the file summary sheet on the web at: 
http : //www. cas . org/ONLINE/DBSS/registryss . html 

L2 30 SEA FILE=REGISTRY ABB-ON EfQSHYE] [SQITNP] [FTSPLI] [NSKMTP] [DKTE 

] [ FLRI ] [TSN]R[IVA] /SQSP 

=> d rn en sql kwic nte 1c. 1-30 12 

L2 ANSWER 1 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN , ^ xIn 

RN 654766-87-7 REGISTRY —U^c ft^is"^ # ±o ^fzh ^^U^^ ™ e$TX 

CN Protein (Rhodopseudomonas palustris clone US20030233675-SEQID-2064 1 ) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN 641: PN: US20030233675 SEQID: 20641 claimed protein 
SQL 345 ( ^ 

SQL' /Uc£.u.yvcu~ tentpn 

SEQ <51 RVIENQGLFW KLALNGIILN TRPARKAKDY QKIWNHEKNE SPLKTITRAQ 

HITS AT: 90-99 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS 

L2 ANSWER 2 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 626155-11-1 REGISTRY 

CN Ferrochelatase (Rhodopseudomonas palustris CGA009 strain CGA009 gene hemZ) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAE26319 

CN GenBank CAE26319 (TRANSLATED FROM: GenBank BX572595) 
SQL 345 

SEQ 51 RVIENQGLFW KLALNGIILN TRPARKAKDY QKIWNHEKNE SPLKTITRAQ 

HITS AT: 90-99 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS 

L2 ANSWER 3 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
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RN 626113-69-7 REGISTRY 

CN Protein (Bdellovibrio bacteriovorus strain HD100 292-amino acid) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 
CN GenBank CAE7 8 991 

CN GenBank CAE78991 (TRANSLATED FROM: GenBank BX842-648) 
SQL 292 

SEQ 201 VNGVLAYALE KTDYERNNIA VPPLKFLKVK NMPNHISVWA SCEKSLRGEQ 



251 ILKDLTRAVR EKNVKGKIYK YFMQELPPDM KKGYRALYDT PP 

HITS AT: 249-258 

LC STN Files: CA, CAPLUS 

L2 ANSWER 4 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 623719-21-1 REGISTRY 

CN GenBank CAD54 412 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank CAD54412 (TRANSLATED FROM: GenBank AJ511870) 
SQL 1726 

SEQ 101 LCNYVFSAHC GDAYEDFNIQ LRRVQESNTT TLSRVTMKLD GLVVELTKSS 



HITS AT: 126-135 

LC STN Files: CAPLUS 

L2 ANSWER 5 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 602352-27-2 REGISTRY 

CN Protein (Methanopyrus kandleri strain AV19 gene MK1614) (9CI) (CA INDEX 

NAME) 
OTHER NAMES: 

CN 1613: PN: WO03076575 SEQID: 1614 claimed sequence 
SQL 157 

SEQ . 1 MAEKKNEQEI QQELQRLIAE INRLQGQMEA INAQIDLIES SISELNRVEE 



HITS AT: 39-48 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS 

L2 ANSWER 6 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 5 73581-3 9-2 REGISTRY 

CN ABC transporter, periplasmic sugar-binding protein (Chromobacterium 

violaceum strain ATCC 12472 gene CV0262) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAQ57941 

CN GenBank AAQ57941 (Translated from: GenBank AE016910) 
SQL 415 

SEQ 251 LAMLESTPTT LTRVRDEAPK VYRDTRVAAA PLGPTGIAAG GWMFNFAVAK 
HITS AT: 255-264 

LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 7 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 5704 64-4 4-7 REGISTRY 

CN Protein (Proteus mirabilis strain ATCC202157 clone US6605709-SEQID-7 333 

open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 3161: PN: US6605709 SEQID: 7333 claimed protein 
SQL 254 
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SEQ 51 ITSTRTALGK GLFEISVKKE NIGIAIQILE EYQLPTISRI EITQLFPSDA 

HITS AT: 81-90 

LC STN Files: CA, CAPLUS 

L2 ANSWER 8 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 549605-92-7 REGISTRY 

CN 2217: PN : WO03008540 SEQID: 8061 unclaimed protein (9CI) (CA INDEX NAME) 
SQL 4 30 

SEQ 351 QQQKVAFLFC CGCSMCEETL TDRNRVKKAQ QYHLPTPNRI SGLETSHRRT 
HITS AT: 367-37 6 

LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 9 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 524076-00-4 REGISTRY 

CN 10: PN: US6566583 SEQID: 10 unclaimed protein (9CI) (CA INDEX NAME) 
SQL 543 

SEQ 1 MNPTATNEML SPWPWAVTES NISFDVQVME QQLKDFSRAC YVVNHADHGF 

HITS AT: 30-39 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS, USPATFULL 

L2 ANSWER 10 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 510915-30-7 REGISTRY 

CN Protein (Propionibacterium acnes strain ATCC6919 clone 

WO03033515-SEQID-27954 open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 3954: PN: WO03033515 SEQID: 27954 claimed protein 
SQL 113 

SEQ 51 APNSTAFCRS PVSIRPATRP AAKESPPPTR SRISRLGRST ERTKPVSFDH 

HITS AT: 7 4-83 

* * RELATED SEQUENCES AVAILABLE WITH SEQLINK* * 
LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 11 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 510901-64-1 / REGISTRY 

CN Protein (Propionibacterium acnes strain ATCC6919 clone 

WO03033515-SEQID-26541 open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2541: PN: WO03033515 SEQID: 26541 claimed protein 
SQL 7 9 

SEQ 1 TIEVSPGVVM ARAPWAAPNS TAFCRSPVSI RPATRPAAKE SPPPTRSRIS 



HITS AT : . 4 0-4.9 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 12 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 489862-35-3 REGISTRY 

CN GenBank CAA584 38 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank CAA58438 (Translated from: GenBank X83413) 
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SQL 870 

SEQ 101 SKGLNKGIFE NMFTNKEKFE SQFSDINRAL LRLGNFIKWG SNVAIDTPYV 
HITS AT: 120-129 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

L2 ANSWER 13 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 485104-00-5 REGISTRY 

CN GenBank AAA4 6012 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AAA46012 (Translated from: GenBank M87287) 
SQL 870 

SEQ 101 SKGLNKGIFE NMFTNKEKFE SQFSDINRAL LRLGNFIKWG SNVAIDTPYV 
HITS AT: 120-129 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

L2 ANSWER 14 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 

RN 483502-77-8 REGISTRY 

CN GenBank BAB17787 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank BAB17787 (Translated from: GenBank AB042530) 

SQL 1851 

SEQ 101 YFPGLCNYVF SEHCGAAYED FNIQLRRGLE SNSTTLSRVI MKLDGLVVEL 



HITS AT: ir 130-139 
NTE 



type 


location 




description 


uncommon 


Aaa-1846 







L2 ANSWER 15 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 

RN 459531-51-2 REGISTRY 

CN GenBank AAB81126 (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AAB81126 (Translated from: GenBank U73935) 

SQL 543 

SEQ 1 MNPTATNEML SPWPWAVTES NISFDVQVME QQLKDFSRAC YVVNHADHGF 



HITS AT: 30-39 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 

L2 ANSWER 16 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 432279-88-4 REGISTRY 

CN Protein ( Propionibacterium acnes strain ATCC6919 clone 

WO0181581-SEQID-27954 open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES : 

CN 2954: PN: WO0181581 SEQID: 27954 claimed protein 
SQL 113 

SEQ 51 APNSTAFCRS PVSIRPATRP AAKESPPPTR SRISRLGRST ERTKPVSFDH 



HITS AT: 74-83 
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** RELATED SEQUENCES AVAILABLE WITH SEQLTNK** 
LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 17 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 432266-21-2 REGISTRY . 

CN Protein ( Propionibacterium acnes strain ATCC6919 clone 

WO0181581-SEQID-26541 open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1541: PN: WO0181581 SEQID: 26541 claimed protein 
SQL 7 9 

SEQ 1 TIEVSPGVVM ARAPWAAPNS TAFCRSPVSI RPATRPAAKE SPPPTRSRIS 



HITS AT: 40-49 ' ' ^ ^ 

**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 18 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 406671-25-8 REGISTRY 

CN Prefoldin, molecular chaperone implicated in de novo protein folding 
(Methanopyrus kandleri strain AV19 gene GIM5) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAM02827 

CN GenBank AAM02827 (Translated from: GenBank AE010451) 
SQL 157 

SEQ 1 MAEKKNEQEI QQELQRLIAE INRLQGQMEA INAQIDLIES SISELNRVEE 



HITS AT: 39-48 - 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS 

L2 ANSWER 19 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 353881-58-0 REGISTRY 

CN Galactose-l-phosphate uridyltransf erase (Clostridium acetobutylicum strain 

ATCC 824 gene CAC2961) (9CI) (CA INDEX NAME) 
OTHER NAMES: 
CN GenBank AAK80903 

CN GenBank AAK80903 (Translated from: GenBank AE007793) 
SQL 4 97 

SEQ 201 LNKSKWFLQY SPYTYYNEHC I ILNNEHIPM KISRITFENL LSFIDILPHY 

HITS AT: 226-235 

LC STN Files: CA, CAPLUS 

L2 ANSWER 20 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 348670-68-8 REGISTRY 

CN Protein (Sulfolobus solfataricus gene SS02259) (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN GenBank AAK42425 

CN GenBank AAK42425 (Translated from: GenBank AE006830) 
SQL 220 

SEQ 101 KGILDPIIGL LEDEESLGKI INALINDFTL NLINHWEEII NDLSRIDLTN 



HITS 

LC 



AT: 137-146; 
STN Files: C 



CA, CAPLUS 



L2 
RN 



ANSWER 21 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
342056-52-4 REGISTRY 
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CN Protein (Shewanella putrefaciens 542-amino acid) (9CI) (CA INDEX NAME ) 
OTHER NAMES: 

CN 4: PN: JP2001145490 SEQID: 13 claimed protein 
SQL 542 

SEQ 1 MNPTATNEML SPWPWAVTES NISFDVQVME QQLKDFSRAC YVVNHADHGF 



HITS AT: ,- 30-39 ' 

LC STN Files: CA, CAPLUS 

L2 ANSWER 22 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 325762-37-6 REGISTRY 

CN Blood-coagulation factor X, prepro- [227-serine, 228-glutamine, 22 9- 

threonine, 230-serine, 231-lysine, 232-leucine, 233-threonine] (human liver) 
(9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 8: PN: WO0110896 SEQID: 2 claimed protein 
SQL 488 

SEQ 201 KPYDAADLDP TENPFDLLDF NQTQPESQTS KLTRIVGGQE CKDGECPWQA 



HITS AT: 226-235 

LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 23 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 325762-36-5 REGISTRY 

CN Blood-coagulation factor X, prepro- [227-glutamine, 228-serine, 229- 

phenylalanine, 230-asparagine, 231-aspartic acid, 232-phenylalanine , 233- 
threonine] (human liver) (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 7: PN: WO0110896 SEQID: 2 claimed protein 
SQL 488 

SEQ 201 KPYDAADLDP TENPFDLLDF NQTQPEQSFN DFTRIVGGQE CKDGECPWQA 
HITS AT: 226-235 

LC STN Files: CA, CAPLUS, TOXCENTER 

L2 ANSWER 24 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 303239-87-4 REGISTRY 

CN Protein (Arabidopsis thaliana clone Ceres_2173674 ) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1354: PN: EP1033405 SEQID: 66354 claimed protein 
SQL 151 

SEQ 1 MPCSSDHEAW MKCYKENIGS PLKCSGFVKS FQDCARRSRQ QVNPEENSNT 



51 LNRVNLGEQI FLSIFNVMTR MMLGAIVEEE ERTILGNELK KLILLFQISK 

HITS AT: 4 5-54 

LC STN Files: CA, CAPLUS 

L2 ANSWER 25 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 303239-86-3 REGISTRY 

CN Protein (Arabidopsis thaliana clone Ceres__2173673 ) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1353: PN : EP1033405 SEQID: 66353 claimed protein 
SQL 217 

SEQ 101 ARRSRQQVNP EENSNTLNRV NLGEQIFLSI FNVMTRMMLG AIVEEEERTI 



HITS AT: 111-120 £ 

LC STN Files: CA, CAPLUS 
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L2 ANSWER 26 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 

RN 303239-85-2 REGISTRY 

CN Protein (Arabidopsis thaliana clone Ceres_2173672) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1352: PN: EP1033405 SEQID: 66352 claimed protein 

SQL 274 

SEQ 151 VKSFQDCARR SRQQVNPEEN SNTLNRVNLG EQIFLSIFNV MTRMMLGAIV 



HITS AT: 168-177 " " ^ " * ' 

LC STN Files: CA, CAPLUS 

L2 ANSWER 27 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 167975-3 8-4 . REG I S T R Y 

CN Phosphoprotein (human herpesvirus 6 strain U1102 gene Ull) (9CI) (CA 

INDEX NAME) 
OTHER CA INDEX NAMES: 

CN Phosphoprotein (human herpes virus 6 strain U1102 gene Ull) 
SQL 870 

SEQ 101 SKGLNKGIFE NMFTNKEKFE SQFSDINRAL LRLGNFIKWG SNVAIDTPYV 
HITS AT : | 120- 129" 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS 

L2 ANSWER 28 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 153926-89-7 REGISTRY 

CN Protein (Shewanella putrefaciens clone pEPA 543-amino acid reduced) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN Protein (Shewanella putrefaciens strain SCRC-2874 open reading frame ORF9) 
CN Protein (Shewanella tumefaciens clone pEPA open reading frame ORF9) 
SQL 543 

SEQ 1 MNPTATNEML SPWPWAVTES NISFDVQVME QQLKDFSRAC YVVNHADHGF 

HITS AT: - 30-39 

* ^RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
LC STN Files: CA, CAPLUS, US PAT FULL 

L2 ANSWER 2 9 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 153676-37-0 REGISTRY 

CN Protein (human herpesvirus 6 strain U1102 gene P1LF1) (9CI) (CA INDEX 
NAME) 

OTHER CA INDEX NAMES: 

CN Protein (human herpes virus 6 strain U1102 gene P1LF1) 

OTHER NAMES: 

CN GenBank AAA16716 

CN GenBank AAA16716 (Translated from: GenBank L25528) 
SQL 871 

SEQ 101 NSKGLNKGIF ENMFTNKEKF ESQFSDINRA LLRLGNFIKW GSNVAIDTPY 

HITS AT: 121-130 : 

LC STN Files: CA, CAPLUS 

L2 ANSWER 30 OF 30 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 14.7156-14-7 REGISTRY 

CN Protein p 100 (human herpesvirus 6 clone pDF44 6-4 /pDF4 4 6-3/pD2Hae/pMF101R) 
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(9CI) (CA INDEX NAME) 
OTHER CA INDEX NAMES: 

CN Protein p 100 (human herpes virus 6 clone pDF4 4 6-4 /pDF4 4 6- 

3/pD2Hae/pMF101R) 
OTHER NAMES : 

CN P100 capsid protein (human herpesvirus 6) 
SQL 869 

SEQ 101 SKGLNKGIFE NMFTNKEKFE SQFSDINRAL LRLGNFIKWG SNVAIDTPYV 



HITS AT: 120-129 

LC STN Files: CA, CAPLUS, MEDLINE, TOXCENTER, USPATFULL 



=> fil capl uspatf medl toxcenter; s 12 

FILE 1 CAPLUS ' ENTERED AT 15:54:21 ON 14 APR 2004 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2004 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE ' USPATFULL ' ENTERED AT 15:54:21 ON 14 APR 2004 

CA INDEXING COPYRIGHT (C) 2004 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE 'MEDLINE 1 ENTERED AT 15:54:21 ON 14 APR 2004 

FILE 'TOXCENTER 1 ENTERED AT 15:54:21 ON 14 APR 2004 
COPYRIGHT (C) 200 4 ACS 



L4 



44 L2 



=> dup rem 14 

PROCESSING COMPLETED FOR L4 

L5 :>V ¥\ vW\ 33 DUP REM L4 (11 DUPLICATES REMOVED) 
ANSWERS '1-28' FROM FILE CAPLUS 
ANSWERS '29-31' FROM FILE USPATFULL 
ANSWERS '32-33' FROM FILE MEDLINE 

•=> d ibib ed ab hitrn 1-31; d iall 32-33 



L5 ANSWER 1 OF 33 CAPLUS COPYRIGHT 2004 ACS on STN DUPLICATE 1 



ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



INVENTOR (S) 



PATENT ASSIGNEE (S) : 
SOURCE : 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2003:326611 CAPLUS 
138:298927 

Propionibacterium acnes genes and encoded protein 
sequences and their use in therapy and diagnosis of 
acne vulgaris 

Mitcham, Jennifer L.; Skeiky, Yasir A. W. ; Persing, 

David H.; Bhatia, Ajay; Maisonneuve, Jean-Francois L.; 

Zhang, Yanni; Wang, Siqing; Jen, Shyian; Lodes, 

Michael J.; Benson, Darin R. ; Jones, Robert; Carter, 

Darrick; Barth, Brenda; Vallieve-Douglass , John 

Corixa Corporation, USA 

PCT Int. Appl., 1481 pp. 

CODEN: PIXXD2 

Patent 

English 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



WO 2003033515 Al 20030424 WO 2002-XF32727 20021011 

W: AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, 
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ED 
AB 



IT 



Entered STN: 30 Apr 2003 

Compns. and methods for the therapy and diagnosis of acne vulgaris and 
other related conditions are disclosed. The invention provides 299 contig 
sequences assembled- from 56,716 fragments of the Propionibacterium acnes 
(strain ATCC 6919 or NCTC737) genome. Translation of all 299 contigs in 
all 6 possible reading frames identified 28,913 open reading frames at 
least 50 amino acids in length. BLAST and GenMark bioinf ormatic analyses 
were used to identify protein-coding regions from the DNA sequence. 
Re-anal, using Phrap reduced the no. of contigs to 105, and identified 
26,4 62 ORFs including an addnl . 1692 new putative ORFs . Several P. acnes 
antigens were identified by serol. expression cloning to be correlated 
with patients with histories of severe acnes. Given the importance of 
proteins as immunotherapeutic/vaccine targets, BLASTp and TBLASTN searches 
identified specific ORFs as members of the classes of transferase, 
enterotoxin, lipoprotein, membrane, permease, enterobactin, 

protease/proteinase, protein A, secreted, dismutase, adhesin, transporter, 
hemolysin, penicillin-binding protein, sialidase, siderophore, and lipase 
proteins. Therapeutic compns. may comprise one or more Propionibacterium 
acnes proteins, immunogenic portions thereof, or polynucleotides that 
encode such portions. Alternatively, a therapeutic compn. may comprise an 
antibody that binds a Propionibacterium acnes protein, antigen-presenting 
cell that expresses a Propionibacterium acnes protein, or a T cell that is 
specific for cells expressing such a protein. Such compns. may be used, 
for example, for the prevention and/or treatment of acne. An animal model 
for P. acnes-induced inflammatory acne is also provided for identification 
of immunogenic proteins by serol. expression cloning and proteomic anal. 
[This abstr. record is one of eight records for this document necessitated 
by the large no. of index entries required to fully index the document and 
publication system constraints.]. . i i j. / jl 

RL: BSU (Biological study, unclassified); DGN (Diagnostic use); PRP^ 
(Properties); THU (Therapeutic use); BIOL (Biological study); USES (Uses) 
(amino acid sequence; Propionibacterium acnes genes and encoded protein 
sequences and their use in therapy and diagnosis of acne vulgaris) 
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WO 2003008540 A2 20030130 WO 2002-XA19668 20020621 

W: AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ,-CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
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RW: GH, GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW, AT, BE, CH, 
CY, DE, DK, ES, FI , FR, GB, GR, IE, IT, LU, MC, NL, PT, SE, TR, 
BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, . TD, TG 

WO 2003008540 A2 20030130 WO 2002-US19668 20020621 

WO 2003008540 A3 20031204 

W: AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, NZ, OM, PH, 
PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TN, TR, TT, TZ, 
UA, UG, US, UZ, VN, YU, ZA, ZM, ZW, AM, AZ, BY, KG, KZ, MD, RU, 
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US 2004010815 Al 20040115 . US 2002-259194 20020926 

US 2004016025 Al 20040122 US 2002-260238 20020926 

PRIORITY APPLN. INFO. : US 2001-300112P P 20010622 
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US 2002-370620P- P 20020404 
US 2002-370743P P 20020404 

ED Entered^STN: 15 Jul 2003 

AB Abiotic stress responsive polynucleotides and polypeptides are disclosed. 
Also disclosed are vectors, expression cassettes, host cells, and plants 
contg. such polynucleotides. Also provided are methods for using such 
polynucleotides and polypeptides, for example, to alter the responsiveness 
of a plant to abiotic stress. Rice (Oryza sativa japonica) cDNA library 
was constructed and sequenced, and used in GeneChip std. protocol for 
expression profiling of stress-regulated genes. Based on the profiles, 
clusters of nucleic sequences that were altered at least two-fold in 
response to the stress condition were identified. Identification of 
abiotic stress responsive genes using yeast two hybrid system was also 
demonstrated. Rice orthologs of Arabidopsis abiotic stress genes were 
identified by reverse genetics. Transgenic rice expressing "abiotic 
stress candidate gene" was produced. The present invention claimed 
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abiotic stress responsive cDNAs (SEQ IDs 1-4131, 8263-8353, 8445-8829 and 
17505-17506) and proteins (SEQ IDs 4132-8262, 8354-8444, and 8830-9214), 
but the Sequence Listing was not made available on publication of the 
patent application. 
IT 549605-92-7 

RL: PRP (Properties) 

(unclaimed protein sequence; abiotic stress responsive polynucleotides 
and polypeptides from plants and methods of altering the stress 
responsiveness of a plant) 
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ED Entered STN: 22 May 2003 

AB The present invention provides protein and cDNA sequences of a novel 

Schizochytrium aggregatum polyketide-like synthesis (PKS-like) gene. The 
present invention relates to compns . and methods for prepg. poly-unsatd. 
long chain fatty acids in plants, plant parts and plant cells, such as 
leaves, roots, fruits and seeds. Nucleic acid sequences and constructs 
encoding PKS-like genes required for the poly-unsatd. long chain fatty 
acid prodn., including the genes responsible for eicosapentenoic acid 
prodn. of Shewanella putrefaciens and novel genes assocd. with the prodn. 
of docosahexaenoic acid in Vibrio marinus are used to generate transgenic 
plants, plant parts and cells which contain and express one or more 
transgenes encoding one or more of the PKS-like genes assocd. with such 
long chain polyunsatd. fatty acid prodn. Expression of the PKS-like genes 
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in the plant system 
chain fatty acids s 
for modification of 
tissues. Manipulat 
com. quantities of 
IT 524076-00-4 - 

RL: PRP (Properties 
(unclaimed prote 
Schizochytrium a 
use) 

REFERENCE COUNT: 



permits the large scale prodn. of poly-unsatd. long 
uch as eicosapentenoic acid and docosahexaenoic acid 

the fatty acid profile of plants, plant parts and 
ion of the fatty acid profiles allows for the prodn. of 
novel plant oils and products. 

) 

in sequence; protein and cDNA sequences of a 
ggregatum polyketide-like synthase (PKS-like) gene and 

26 THERE ARE 26 CITED REFERENCES AVAILABLE FOR THIS 
RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 
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Fabricia; Nicolas, Marisa Fabiana; Oliveira, 
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Fabio de Oliveira; Pena, Sergio Danilo Junho; Pereira, 
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Rodrigues Costa; Pinto, Luciano da Silva; Porto, Jorge 
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Cicero Eduardo; Reis, Alessandra Maria Moreira; Rigo, 
Liu Urn; Rondinelli, Edson; Pedraca do Santos, Elen 
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Proceedings of the National Academy of Sciences of the 
United States of America (2003), 100(20), 11660-11665 
CODEN: PNASA6; ISSN: 0027-8424 
National Academy of Sciences 
Journal 
English 
Entered STN: 17 Oct 2003 
Chromobacterium violaceum is one of millions of species of free-living 
microorganisms that populate the soil and water in the extant areas of 
tropical biodiversity around the world. The complete genome of C. 
violaceum consists of a single circular chromosome of 4,751,080 bp with an 
av. G+C content of 64.83% and 4431 uniformly distributed protein coding 
ORFs that cover 8 9% of the genome. Its complete genome sequence reveals 
(i) extensive alternative pathways for energy generation, (ii) . apprx.500 
ORFs for transport-related proteins, (iii) complex and extensive systems 
for stress adaptation and motility, and (iv) widespread utilization of 
quorum sensing for control of inducible systems, all of which underpin the 
versatility and adaptability of the organism. The genome also contains 
extensive but incomplete arrays of ORFs coding for proteins assocd. with 
mammalian pathogenicity, possibly involved in the occasional but often 
fatal cases of human C. violaceum infection.' There is, in addn., a series 
of previously unknown but important enzymes and secondary metabolites 
including paraquat-inducible proteins, drug and heavy-metal-resistance 
proteins, ■ multiple chitinases, and proteins for the detoxification of 
xenobiotics that may have biotechnol. applications. The genome sequence 
is deposited in GenBank/EMBL/DDB J under accession no. AE016825 and in the 
NCBI RefSeq Genome Database under accession no. NC_005085. 
573581-39-2 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 

(Biological study) 

(amino acid sequence; complete genome sequence of Chromobacterium 
violaceum reveals remarkable and exploitable bacterial adaptability) 
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WO 2001081581 A2 20011101 WO 2001-US12865 20010420 

WO 2001081581 A3 20020314 

WO 2001081581 C2 20030103 

W: AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
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LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, NZ, PL, PT, RO, 
RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, 
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PRIORITY APPLN. INFO. : US 2000-199047P P 20000421 

US 2000-208841P P 20000602 
US 2000-216747P P 20000707 
WO 2001-US12865 W 20010420 

ED Entered STN: 06 Feb 2002 

AB Compns . and methods for the therapy and diagnosis of acne vulgaris and 

other related conditions are disclosed. Compns. may comprise one or more 
Propionibacterium acnes proteins, immunogenic portions thereof, or 
polynucleotides that encode such portions. Thus, overlapping clones 
representing .apprx.8.6 full-length genome equiv. from a P. acnes genomic 
library were aligned to form 299 linear contigs. These 299 contigs 
represent a total assembled length of about 2,656,860 nucleotides covering 
>90% of the P. acnes genome. Six-frame translation is performed in order 
to predict 28,913 open reading frames encoding P. acnes polypeptide 
sequences .gtoreq.50 amino acids in length. A therapeutic compn. may also 
comprise an antibody that binds a P. acnes protein, antigen-presenting 
cells that express a P. acnes protein, or a T cell that is specific for 
cells expressing such a protein. Such compns. may be used, for example, 
for the prevention and/or treatment of acne. 

IT 432266-21-2 432279-88-4 

RL: ANT (Analyte); BSU (Biological study, unclassified); DGN (Diagnostic 
use); PRP (Properties); THU (Therapeutic use); ANST (Analytical study); 
BIOL (Biological study); USES (Uses) 

(amino acid sequence; Propionibacterium acnes nucleic acids and 
proteins useful for therapy and diagnosis of acne vulgaris) 
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MAR PAT 134:168320 
Entered STN: 15 Feb 2001 

The invention relates to factor Xa analogs with a modified protease 
cleavage site, comprising a substitution of a min. of one of the amino 
acid between Glu226 and Arg234 and possibly Ile235 in the region of 
activation peptide. These modified cleavage sites in the region of 
activation peptide change protease specificity and facilitate factor XIa 
cleavage of the precursor. The invention also relates to prepris . contg. 
said factor Xa analogs and methods for the prodn. thereof. The 
prepro-f actor X analogs may be used to produce high-purity factor X for 
use as coagulants. 
325762-36-5P 325762-37- 6P 
RL: BPN (Biosynthetic preparation); BPR (Biological process); BSD 
(Biological study, unclassified); PUR (Purification or recovery); THU 
. (Therapeutic use); BIOL (Biological study); PREP (Preparation); PROC 
(Process); USES (Uses) 

(amino acid sequence; factor X substitution mutant with improved 
ability to be activated) 
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1998 :572263 CAPLUS 
129:212504 

Genes coding for eicosapentaenoic acid synthetic 
enzymes and use for production of eicosapentaenoic 
acid in Escherichia coli 

Yazawa, Kazunaga; Yamada, Akiko; Kato, Seishi; Kondo, 
Kiyosi 

Sagami Chemical Research Center, Japan 

U.S., 81 pp., Cont .-in-part of U.S. 5,683,898. 

CODEN: USXXAM 

Patent 

English 

3 



PATENT NO. 



US 5798259 
US 5683898 



KIND DATE 



A 
A 



19980825 
19971104 



APPLICATION NO. DATE 



US 1996-752929 
US 1995-375709 



19961120 
19950120 
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PRIORITY APPLN. INFO. : JP 1992-147945 19920515 

US 1994-178251 19940110 
US 1995-375709 19950120 

ED Entered STN: 08 Sep 1998 

AB Claimed are DNA sequences .encoding eicosapentaenoic acid (EPA) 

biosynthetic enzymes derived from Shewanella putref aciens, and their 
cloning in Escherichia coli for the prodn. of EPA. There is provided an 
advantageous process for prodn. of EPA by a recombinant technique wherein 
genes coding for EPA biosynthesis enzymes useful as pharmaceuticals, 
agrochems., foods, feeds or the like are obtained from microorganisms. 
EPA is produced by obtaining genes coding for eicosapentaenoic acid (EPA) 
biosynthesis enzymes, constructing a plasmid by joining the genes to a 
vector, transforming E. coli with the plasmid, and culturing the 
transformed E. coli. 

IT 153926-89-7 

KL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; genes coding for eicosapentaenoic acid synthetic 
enzymes and use for prodn. of eicosapentaenoic acid in E. coli) 
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1997 : 719630 CAPLUS 
128: 10898 

Sequence of Shewanella putrefaciens gene coding for 9 
eicosapentaenoic acid synthesizing enzymes and process 
for production of eicosapentaenoic acid with 
expression in Escherichia coli using pEPA vector 
Yazawa, Kazunaga; Yamada, Akiko; Kato, Seishi; Kondo, 
Kiyosi 

Sagami Chemical Research Center, Japan 
U.S., 77 pp., Cont ,-in-part of U.S. Ser. 
abandoned . 
CODEN: USXXAM 
Patent 
English 
3 



No. 178,251, 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



US 5683898 
US 5798259 
PRIORITY APPLN. 



of EPA by a gene 



A 19971104 US 1995-375709 19950120 

A 19980825 US 1996-752929 19961120 

INFO.: JP 1992-147945 19920515 

US 1994-178251 19940110 
US 1995-375709 19950120 

ED Entered STN: 14 Nov 1997 

AB There is provided an advantageous process for prodn. 

recombinant technique wherein genes coding for biosynthesis enzymes for 
eicosapentaenoic acid (EPA) useful as pharmaceuticals, agrochems., foods, 
feeds or the like is obtained from microorganisms. EPA is produced by 
obtaining genes coding for eicosapentaenoic acid (EPA) biosynthesis 
enzymes, constructing a plasmid by joining the genes to a vector, 
transforming E. coli with the plasmid, and culturing the transformed E. 
coli . 

IT 15392 6-8 9-7P 

RL: BPN (Biosynthetic preparation); BSU (Biological study, unclassified); 

PRP (Properties); BIOL (Biological study); PREP (Preparation) 
(amino acid sequence; Shewanella putrefaciens genes encoding 
eicosapentaenoic acid synthesizing enzymes and prodn. of 
eicosapentaenoic acid with Escherichia coli) 
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CAPLUS COPYRIGHT 2004 ACS on STN DUPLICATE 10 
1993:210909 CAPLUS 
118:210909 

Gene for the major antigenic structural protein (plOO) 
of human herpesvirus 6 

Neipel, Frank; Ellinger, Klaus; Fleckenstein, Bernhard 
Inst. Klin. Mol. Virol., Univ. Erlangen-Nuernberg, 
Erlangen, D-8520, Germany 

Journal of Virology (1992), 66(6), 3918-24 
CODEN: JOVIAM; ISSN: 0022-538X 
Journal 
English 
Entered STN: 29 May 1993 

A human herpesvirus 6 (HHV-6) structural protein of 100 kDa (plOO) is the 
polypeptide most frequently and intensively reactive in immunoblott ing 
analyses with human sera on HHV-6-inf ected cells or partially purified 
virions. The gene for plOO was identified by screening a bacteriophage 
lambda library with monospecific rabbit antisera. The gene codes for a 
polypeptide of 870 amino acids with a calcd. mol. size of 97 kDa. Its 
N-terminal third is weakly homologous to the immunogenic basic matrix 
phosphoprotein ppl50 of human cytomegalovirus. Five fragments 
representing more than 93% of HHV-6 plOO were prokaryotically expressed. 
The antigenic epitopes of plOO were preliminarily mapped by immunoblott ing 
with human sera. They are located within the C-terminal part which is 
neither homologous nor cross-reactive to ppl50 of human cytomegalovirus. 
Availability of the gene for the immunodominant structural protein should 
provide tools for studies of pathogenesis by HHV-6. 
147156-14-7 
RL: PRP (Properties) 

(amino acid sequence of, complete) 
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CAPLUS COPYRIGHT 2004 ACS on STN 
2004:68662 CAPLUS 
140:158345 

A predator unmasked: Life cycle of Bdellovibrio 
bacteriovorus from a genomic perspective 
Rendulic, Snjezana; Jagtap, Pratik; Rosinus, Andrea; 
Eppinger, Mark;- Baar, Claudia; Lanz, Christa; Keller, 
Heike; Lambert, Carey; Evans, Katy J. ; Goesmann, 
Alexander; Meyer, Folker; Sockett, R. Elizabeth; 
Schuster, Stephan C. 

Max-Planck-Institute for Developmental Biology, 
Tuebingen, 72076, Germany 

Science (Washington, DC, United States) (2004), 



PUBLISHER: 
DOCUMENT TYPE 
LANGUAGE : 
ED 
AB 



0036-8075 

for the Advancement of Science 



IT 



303 (5658) , 689-692 
CODEN: SCIEAS; ISSN: 
American Association 
Journal 
English 
Entered STN: 28 Jan 2004 

Predatory bacteria remain molecularly enigmatic, despite their presence in 
many microbial communities. The complete genome of Bdellovibrio 
bacteriovorus HD100, a predatory Gram-neg. bacterium that invades and 
consumes other Gram-neg. bacteria, is now reported. Its surprisingly 
large genome shows no evidence of recent gene transfer from its prey. A 
plethora of paralogous gene families coding for enzymes, such as 
hydrolases and transporters, are used throughout the life cycle of B. 
bacteriovorus for prey entry, prey killing, and the uptake of complex 
mols. The complete sequence comprising 3,782,950 bp and 3584 predicted 
open reading frames is deposited in GenBank/EMBL/DDBJ under accession no. 
BX842601 and in the RefSeq database under accession no. NC_005363. 
626113-69-7 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
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(Biological study) 

(amino acid sequence; life cycle of Bdellovibrio bacteriovorus based on 
its complete genomic sequence) 
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1676(3), 240-250 



CAPLUS COPYRIGHT 2004 ACS on STN 
2004 : 146210 CAPLUS 

The mouse secreted gel-forming mucin gene cluster 
Escande, Fabienne; Porchet, Nicole; Bernigaud, Annie; 
Petitprez, Daniele; Aubert, Jean-Pierre; Buisine, 
Marie-Pierre 

Unite 560 INSERM, Lille, 59045, Fr . 
Biochimica et Biophysica Acta (2004; 
CODEN: BBACAQ; ISSN: 0006-3002 
Elsevier Science B.V. 
Journal 
English 
Entered STN: 23 Feb 2004 

Using genomic cosmid and BAC clones and genome shotgun supercontigs 
available in GenBank, we detd. the complete gene structure of the four 
mouse secreted gel-forming mucin genes Muc2, Muc5ac, Muc5b and Muc6 and 
the organization of the genomic locus harboring these genes. The mouse 
secreted gel-forming mucin gene is 215 kb on distal chromosome 7 to 69.0 
cM from the centromere and organized as: Muc6-Muc2-Muc5ac-Muc5b with Muc2, 
Muc5ac and Muc5b arranged in the same orientation and Muc6 in opposite. 
Mouse mucin genes have highly similar genomic organization to each other 
and to their resp. human homologues indicating that they have been well 
conserved through evolution. Deduced peptides showed striking sequence 
similarities in their N- and C-terminal regions whereas the 
threonine/serine/proline-rich central region is specific for each other 
and for species. Expression studies also showed that they have expression 
patterns similar to human mucin genes with Muc2 expressed in small and 
large intestines, Muc5ac and Muc6 in stomach, and Muc5b in 
laryngo-tracheal tract. These data constitute an important initial step 
for investigation of mucin gene regulation and mucin function through the 
use of animal models. 
INDEXING IN PROGRESS 
623719-21-1, GenBank CAD54412 
RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; mouse secreted gel-forming mucin gene cluster) 
REFERENCE COUNT: 34 THERE ARE 34 CITED REFERENCES AVAILABLE FOR THIS 
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Complete genome sequence of the metabolically 
versatile photosynthetic bacterium Rhodopseudomonas 
palustris 

Larimer, Frank W. ; Chain, Patrick; Hauser, Loren; 
Lamerdin, Jane; Malfatti, Stephanie; Do, Long; Land, 
Miriam L. ; Pelletier, Dale A.; Beatty, J. Thomas; 
Lang, Andrew S.; Tabita, F. Robert; Gibson, Janet L. 
Hanson, Thomas E.; Bobst, Cedric; Torres y Torres, 
Janelle L.; Peres, Caroline; Harrison, Faith H.; 
Gibson, Jane; Harwood, Caroline S. 
Genome Analysis and Systems Modeling, Oak Ridge 
National Laboratory, Oak Ridge, TN, 37831, USA 
Nature Biotechnology (2004), 22(1), 55-61 
CODEN: NABIF9; ISSN: 1087-0156 
Nature Publishing Group 
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IT 



Journal 
English 
Entered STN: 05 Jan 2004 

Rhodopseudomonas palustris is among the most metabolically versatile 
bacteria known. It uses light, inorg. compds . , or org. compds., for 
energy. It acquires carbon from many types of green plant-derived compds. 
or by carbon dioxide fixation, and it fixes nitrogen. This report 
describes the genome sequence of R. palustris, which consists of a 
5,459,213-bp circular chromosome with 4836 predicted genes and a plasmid 
of 8427 bp. The sequence reveals genes that confer a remarkably large no. 
of options within a given type of metab., including three nitrogenases, 
five benzene ring cleavage pathways, and four light harvesting 2 systems. 
R. palustris encodes 63 signal transduction histidine kinases and 79 
response regulator receiver domains. Almost 15% of the genome is devoted 
to transport. This genome sequence is a starting point to use R. 
palustris as a model to explore how organisms integrate metabolic modules 
in response to environmental perturbations. The genome sequence of R. 
palustris CGA009 ia available in GenBank/EMBL/DDBJ under accession nos . 
BX571963 (chromosome) and BX571964 (plasmid) and in the RefSeq database 
under accession no. NC__005296. 
626155-11-1 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 

(Biological study) 

(amino acid sequence; complete genome sequence of the metabolically 
versatile photosynthetic bacterium Rhodopseudomonas palustris) 
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The complete genome and protein sequence of the 
hyperthermophile Methanopyrus kandleri AVI 9 and 
monophyly of Archael methanogens, and methods of their 
use 

Slesarev, Alexei I.; Pavlov, Andrey; Pavlova, 
Nadezhda; Kozyavkin, Sergei 

Fidelity Systems, Inc., USA; Malykh, Andrei 

PCT Int. Appl., 1023 pp. 
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AB The complete 1,694,969 nucleotide sequence of the GC-rich genome of 

Methanopyrus kancileri was detd. using a novel approach. It is based on 
unlinking genomic DNA with the ThermoFidelase version of M. kandleri 
topoisomerase V and cycle sequencing directed by 2 1 -modified 
oligonucleotides (FimersK. Sequencing redundancy of 3.3-fold was 
sufficient to assemble the genome with <1 error per 40 kb. Using a 
combination of sequence database searches and coding potential prediction, 
1692 protein-coding genes and 39 genes for structural RNAs were 
identified. M. kandleri proteins show an unusually high content of neg. 
charged amino acids, which might be an adaptation to its high 
intracellular salinity. Previous phylogenetic anal, of 16S RNA suggested 
that M. kandleri belonged to a very deep branch, close to the root of the 
archaeal tree. However, genome comparisons, using both trees constructed 
from concatenated alignments of ribosomal proteins and trees based on gene 
content, indicate that M. kandleri consistently groups with other archaeal 
methanogens. M. kandleri shares the set of genes implicated in 
methanogenesis and, in part, its operon organization with Methanococcus 
jannaschii and Methanothermobacter thermoautotrophicus . These findings 
indicate that archaeal methanogens are monophyletic . A distinctive 
feature of M. kandleri is the paucity of proteins involved in signaling 
and regulation of gene expression. Also, M. kandleri appears to have 
fewer genes acquired via lateral transfer than other archaea. These 
features might reflect the extreme habitat of this organism. 

I T 602352-27-2 

RL: BSU (Biological study, unclassified); BUU (Biological use, 
unclassified); PRP (Properties); BIOL (Biological study); USES (Uses) 
(amino acid sequence; complete genome and protein sequence of 
hyperthermophile Methanopyrus kandleri AV19 and monophyly of Archael 
methanogens and methods of their use) 
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2004:11576 CAPLUS 
140:158554 

Expression of microbial proteins in plants for 

production of plants with improved properties 

Cao, Yongwei; Hinkle, Gregory J.; Slater, Steven C. 

Chen, Xianfeng; Goldman, Barry S. 

USA 

U.S. Pat. Appl. 
CODEN: USXXCO 
Patent 
English 
12 



Publ., 122 pp. 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



US 2003-369493 20030220 
US 2002-360039P P 20020221 



US 2003233675 Al 20031218 

PRIORITY APPLN. INFO. : 
ED Entered STN: 08 Jan 2004 

AB Recombinant constructs and methods useful for improvement of plants are 
provided. In particular, recombinant constructs comprising promoters 
functional in plant cells positioned for expression of polynucleotides 
encoding polypeptides from microbial sources are provided. The disclosed 
constructs and methods find use in prodn. of transgenic plants to provide 
plants, particularly crop plants, having improved properties. [This 
abstr. record is one of twelve records for this document necessitated by 
the large no. of index entries required to fully index the document and 
publication system constraints.]. 

IT 654766-87-7P 

RL: BMF ( Bioindustrial manufacture); BPN (Biosynthetic preparation); BUU 
(Biological use, unclassified); PRP (Properties); BIOL (Biological study); 
PREP (Preparation); USES (Uses) 



Searched by Barb O'Bryen, STIC 571-272-2518 



Robinson 09/632722 



Page 21 



(amino acid sequence; expression of microbial proteins in plants for 
prodn. of plants with improved properties) 

L5 ANSWER 15 OF 33 CAPLUS COPYRIGHT 2004 ACS on STN 

2003:629098 CAPLUS 
139:145044 

Nucleic acid and amino acid sequences relating to 
Proteus mirabilis for diagnostics and therapeutics 
Breton, Gary L. 

Genome Therapeutics Corporation, USA 
U.S., 870 pp. 
CODEN: USXXAM 
Patent 
English 
2 



ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 

INVENTOR (S) : 
PATENT ASSIGNEE (S) : 
SOURCE: 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



US 6605709 
US 6605709 
PRIORITY APPLN . 



Bl 
Bl 



20030812 
20030812 



INFO. 



US 
US 



US 2000-543681 
US 2000-543681 

1999- 128706P P 

2000- 543681 A 



20000405 
20000405 
19990409 
20000405 



ED Entered STN: 15 Aug 2003 

AB The invention provides isolated 4172 polypeptide and 4172 genomic DNA 
sequences derived from Proteus mirabilis strain 16525 (ATCC 202157). 
Chromosomal DNA was isolated after Zymolyase digestion and 
hydrodynamically sheared and the fragments of the genomic library 
sequenced and assembled. To identify P. mirabilis polypeptides, the 
compete genomic sequence of P. mirabilis was analyzed to identify all 
possible stop-to-stop open reading frames greater than 180 nucleotides in 
all 6 reading frames, the identified ORFs analyzed for homol. to known 
sequences, and the coding potential of non-homologous sequences evaluated 
with the program GENEMARK . RTM . . These sequences are useful in diagnosis 
and therapy of pathol. conditions, antibodies against the polypeptides, 
and methods for the prodn. of the polypeptides. The invention also 
provides methods for the detection, prevention and treatment of pathol. 
conditions resulting from bacterial infection. [This abstr. record is one 
of two records for this document necessitated by the large no. of index 
entries required to fully index the document and publication system 
constraints . ] . 

IT 570464-44-7 

RL: BSU (Biological study, unclassified); DGN (Diagnostic use); PRP 
(Properties); THU (Therapeutic use) ; BIOL (Biological study); USES (Uses) 

(amino acid sequence; nucleic acid and amino acid sequences relating to 

Proteus mirabilis for diagnostics and therapeutics) 



L5 ANSWER 16 OF 
ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 

AUTHOR (S) : 



CORPORATE SOURCE: 
SOURCE: 



PUBLISHER: 



33 CAPLUS COPYRIGHT 2004 ACS on STN 
2002:290302 CAPLUS 
136:289776 

The complete genome of hyperthermophile Methanopyrus 
kandleri AV19 and monophyly of archaeal methanogens 
Slesarev, Alexei I.; Mezhevaya, Katja V.; Makarova, 
Kira S.; Polushin, Nikolai- N.; Shcherbinina, Olga V. ; 

Belova, Galina I.; Aravind, L.; 
Rogozin, Igor B.; Tatusov, Roman 
Stetter, Karl 0.; Malykh, Andrei 
G.; Koonin, Eugene V.; Kozyavkin, Sergei A. 
Fidelity Systems, Gaithersburg, MD, 20879, USA 
Proceedings of the National Academy of Sciences of the 
United States of America (2002), 99(7), 4644-4649 
CODEN: PNASA6; ISSN: 0027-8424 
National Academy of Sciences 



Shakhova, Vera V.; 
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L.; Wolf, Yuri re- 
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DOCUMENT TYPE: Journal 
LANGUAGE: English 
ED Entered STN: 18 Apr 2002 

AB The complete 1 , 694 , 969-nucleotide sequence of the GC-rich genome of 

Methanopyrus kandleri was^detd. by using a whole direct genome sequencing 
approach. This approach is based on unlinking of genomic DNA with the 
ThermoFidelase version of M. kandleri topoisomerase V and cycle sequencing 
directed by 2 '-modified oligonucleotides (Fimers) . Sequencing redundancy 
(3.3-fold) was sufficient to assemble the genome with less than one error 
per 4 0 kb. Using a combination of sequence database searches and coding 
potential prediction, 1692 protein-coding genes and 39 genes for 
structural RNAs were identified. M . kandleri proteins show an unusually 
high content of neg. charged amino acids, which might be an adaptation to 
the high intracellular salinity. Previous phylogenetic anal, of 16S RNA 
suggested that M. kandleri belonged to a very deep branch, close to the 
root of the archaeal tree. However, genome comparisons indicate that, in 
both trees constructed using concatenated alignments of ribosomal proteins 
and trees based on gene content, M. kandleri consistently groups with 
other archaeal methanogens . M. kandleri shares the set of genes 
implicated in methanogenesis and, in part, its operon organization with 
Methanococcus jannaschii and Methanothermobacter thermoautotrophicum. 
These findings indicate that archaeal methanogens are monophyletic . A 
distinctive feature of M. kandleri is the paucity of proteins involved in 
signaling and regulation of gene expression. Also, M. kandleri appears to 
have fewer genes acquired via lateral transfer than other archaea. These 
features might reflect the extreme habitat of this organism. The sequence 
is deposited in GenBank under Accession No. AE009439. 

IT 406671-25-8 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; complete genome sequence of Methanopyrus kandleri 
AV19 and monophyly of archaeal methanogens) 
REFERENCE COUNT: 56 THERE ARE 56 CITED REFERENCES AVAILABLE FOR THIS 

RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 
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2001:385842 CAPLUS 
135:1234 

Eicosapentaenoic acid biosynthetic production by 

recombinant marine cyanobacteria, Synechococcus 

Yazawa, Kazuyoshi; Yu, Reiko; Yamada, Akiko; 

Matsunaga, Sunao; Takeyama, Haruko; Kurane, Ryuichiro 

Sagami Chemical Research Center, Japan; Bioindustry 

Association; Ministry of Economy, Trade and Industry; 
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Patent 
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PATENT NO. 
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APPLICATION NO. DATE 
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20010529 



JP 1999-329169 
JP 1999-329169 
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19991119 



JP 2001145490 
PRIORITY APPLN. INFO. 
ED Entered STN: 29 May 2001 

AB Transformation of marine cyanobacteria with eicosapentaenoic acid (EPA) 
synthetic enzyme gene cluster for EPA biosynthetic prodn., is disclosed. 
Plasmid vectors contg. the gene cluster are claimed. The eicosapentaenoic 
acid (EPA) synthesis gene cluster from an EPA-producing bacterium, 
Shewanella sp. SCRC-2738, was cloned into a broad-host range vector, 
pJRD215,. and then introduced into a marine cyanobacterium, Synechococcus 
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sp. NKBG15041c, by conjugation. The transconjugant cyanobacteria produced 
3.7 .+-. 0.2% (2.24 .+-. 0.13 mg/L) EPA (n-3) and 2.5 .+-. 0.2% (1.49 . +- . 
0.06 mg/L) eicosatetraenoic acid (n-3) of the total fatty acids when the 
cells were cultured at 23. degree. C at a light intensity of 1,000-1,500 Lx. 
The EPA and eicosatetraeridic acid contents of the cells were increased to 
4.6 .+-. 0.6% (3.86 .+-. 1.11 mg/L) and 4.7 0.3% (3.86 . +- . 0.82 

mg/L) , and 7.5 .+-. 0.3% (1.76 .+-. 0.10 mg/L) and 5.1 .+-. 0.2% t (1.19 

o.06 mg/L) when they were cultured at low temp. ( 18 . degree . C) and at 
lower light Intensity (40 Lx) , resp. 
IT 342056-52-4P 

RL: BPN (Biosynthetic preparation) ; BUU (Biological use, unclassified) ; 
PRP (Properties); BIOL (Biological study); PREP (Preparation); USES (Uses) 

(amino acid sequence; eicosapentaenoic acid prodn. by recombinant 

Synechococcus ) 
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DOCUMENT NUMBER: 135:87850 

TITLE : The complete genome of the crenarchaeon Sulfolobus 

solfataricus P2 

AUTHOR(S): She, Qunxin; Singh, Rama K.; Conf alonieri , Fabrice; 

Zivanovic, Yvan; Allard, Ghislaine; Awayez, Mariana 
J.; Chan-Weiher, Christina C.-Y.; Clausen, lb Groth; 
Curtis, Bruce A.; De Moors, Anick; Erauso, Gael; 
Fletcher, Cynthia; Gordon, Paul M. K. ; Heikamp-De 
Jong, Ineke; Jeffries, Alex C; Kozera, Catherine J.; 
Medina, Nadine; Peng, Xu; Thi-Ngoc, Hoa Phan; Redder, 
Peter; Schenk, Margaret E . ; Theriault, Cynthia; 
Tolstrup, Niels; Charlebois, Robert L.; Doolittle, W. 
Ford; Duguet, Michel; Gaasterland, Terry; Garrett, 
Roger A.; Ragan, Mark A.; Sensen, Christoph W.; Van 
der Oost, John 

CORPORATE SOURCE: Microbial Genome Group, Institute of Molecular 

Biology, University of Copenhagen, Copenhagen, 
DK-1307, Den. 

SOURCE* Proceedings of the National Academy of Sciences of the 

United States of America (2001), 98(14), 7835-7840 
CODEN: PNASA6; ISSN: 0027-8424 

PUBLISHER: National Academy of Sciences 

DOCUMENT TYPE: Journal 

LANGUAGE: English 

ED Entered STN: 22 Jul 2001 

AB The genome of the crenarchaeon Sulfolobus solfataricus P2 contains 

2,992,245 bp on a single chromosome and encodes 2977 proteins and many 
RNAs. One-third of the encoded proteins have no detectable homologs in 
other sequenced genomes. Moreover, 40% appear to be archaeal-specif ic, 
and only 12% and 2.3% are shared exclusively with bacteria and eukarya, 
resp. The genome shows a high level of plasticity with 200 diverse 
insertion sequence elements, many putative nonautonomous mobile elements, 
and evidence of integrase-mediated insertion events. There are also long 
clusters of regularly spaced tandem repeats. Different transfer systems 
are used for the uptake of inorg. and org. solutes, and a wealth of 
intracellular and extracellular proteases, sugar, and sulfur-metabolizing 
enzymes are encoded, as well as enzymes of the central metabolic pathways 
and motility proteins. The major metabolic electron carrier is not NADH 
as in bacteria and eukarya but probably ferredoxin. The essential 
components required for DNA replication, DNA repair and recombination, the 
cell cycle, transcriptional initiation and translation, but not DNA 
folding, show a strong eukaryal character with many archaeal-specif ic 
features. The results illustrate major differences between crenarchaea 
and euryarchaea, esp. for their DNA replication mechanism and cell cycle 
processes and their translational app. The complete annotated genome 
sequence is available in GenBank Accession No. AE006641. 
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IT 348670-68-8 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; complete genome of Sulfolobus solfataricus P2) 



REFERENCE COUNT: 



85 - THERE ARE 85 CITED REFERENCES AVAILABLE FOR THIS 
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Genome sequence and comparative analysis of the 
solvent-producing bacterium Clostridium acetobutylicum 
Nolling, Jork; Breton, Gary; Omelchenko, Marina V. ; 
Makarova, Kira S.; Zeng, Qiandong; Gibson, Rene; Lee, 
Hong Mei; Dubois, Joann; Qiu, Dayong; Hitti, Joseph; 
Wolf, Yuri I.; Tatusov, Roman L . ; Sabathe, Fabrice; 
Doucette-Stamm, Lynn; Soucaille, Philippe; Daly, 
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Smith, Douglas R.; Aldredge, Tyler; Ayers, Mark; 
Bashirzadeh, Romina; Bochner, Harry; Boivin, Mike; 
Bross, Susan; Bush, David; Butler, Carole; Caron, 
Anne; Caruso, Anthony; Cook, Robin; Daggett, Patricia; 
Deloughery, Craig; Egan, Jeff; Ellston, Dawna; 
Engelstein, Marcy; Ezedi, Johnny; Gilbert, Katie; 
Goyal, Anil; Guerin, Jennifer; Ho, Tay; Holtham, Kari; 
Joseph, Paul; Keagle, Pamela; Kozlovsky, Julia; 
LaPlante, Mary; LeBlanc, Gary; Lumm, Wendy; Majeski, 
Amy; McDougall, Steve; Mank, Philip; Mao, Jen-I.; 
Nocco, Diane; Patwell, Donivan; Phillips, Jonathon; 
Pothier, Bryan; Prabhakar, Shashi; Richterich, Peter; 
.Rice, Philip; Rosetti, Dawn; Rossetti, Mark; 
Rubenfield, Marc; Sachdeva, Meena; Snell, Philip; 
Spadafora, Rob; Spitzer, Lia; Shimer, George; Thomann, 
Hans-Ulrich; Vicaire, R. ; Wall, Kristen; Wang, Ying; 
Weinstock, Keith; Wong, Lai Peng; Wonsey, A.; Xu, 
Qinxue; Zhang, Liping 

GTC Sequencing Center, Genome Therapeutics 
Corporation, Waltham, MA, 02453, USA 
Journal of Bacteriology (2001), 183(16), 4823-4838 
CODEN: JOBAAY; ISSN: 0021-9193 
American Society for Microbiology 
Journal 
English 
Entered STN: 09 Aug 2001 

The genome sequence of the solvent-producing bacterium Clostridium 
acetobutylicum ATCC 824 was detd. by the shotgun approach. The genome 
consists of a 3.94-Mb chromosome and a 192-kb megaplasmid that contains 
the majority of genes responsible for solvent prodn. Comparison of C. 
acetobutylicum to Bacillus subtilis reveals significant local conservation 
of gene order, which has not been seen in comparisons of other genomes 
with similar, or, in some cases closer, phylogenetic proximity. This 
conservation allows the prediction of many previously undetected operons 
in both bacteria. However, the C. acetobutylicum genome also contains a 
significant no. of predicted operons that are shared with distantly 
related bacteria and archaea but not with B. subtilis. Phylogenetic anal, 
is compatible with the dissemination of such operons by horizontal 
transfer. The enzymes of the solventogenesis pathway and of the 
cellulosome of C. acetobutylicum comprise a new set of metabolic 
capacities not previously represented in the collection of complete 
genomes. These enzymes show a complex pattern of evolutionary affinities, 
emphasizing the role of lateral gene exchange in the evolution of the 
unique metabolic profile of the bacterium. Many of the sporulation genes 
identified in B. subtilis are missing in C. acetobutylicum, suggesting 
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major differences in the sporulation process. Thus, comparative anal, 
reveals both significant conservation of the genome organization and 
pronounced differences in many systems that reflect unique adaptive 
strategies of the two gram-pos . bacteria. The sequence data is available 
in the GenBank database with Accession Nos. AE0014 38 and 
AE007513-AE007868. 
IT 353881-58-0 ; 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; genome sequence and comparative anal, of the 
solvent-producing bacterium Clostridium acetobutylicum) 
REFERENCE COUNT: 88 THERE ARE 88 CITED REFERENCES AVAILABLE FOR THIS 
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WO 2000-US956 
ED Entered STN: 21 Jul 2000 

AB The present invention relates to compns . and methods for prepg. 

polyunsatd. long-chain fatty acids in plants, plant parts and plant cells, 
such as leaves, roots, fruits and seeds. Nucleic acid sequences and 
constructs encoding polyketide synthase (PKS)-like genes required for the 
polyunsatd. long-chain fatty acid prodn., including the genes responsible 
for eicosapentenoic acid prodn. of Shewanella putrefaciens and novel genes 
assocd. with the prodn. of docosahexenoic acid in Vibrio marinus are used 
to generate transgenic plants, plant parts and cells which contain and 
express one or more transgenes encoding one or more of the PKS-like genes 
assocd. with such long chain polyunsatd. fatty acid prodn. PKS-like genes 
from Schizochytrium aggregatum are also provided. Expression of the 
PKS-like genes in the plant system permits the large scale prodn. of 
polyunsatd. long-chain fatty acids such as eicosapentenoic acid and 
docosahexenoic acid for modification of the fatty acid profile of plants, 
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plant parts and tissues. Manipulation of the fatty acid profiles allows 
for the prodn. of com. quantities of novel plant oils and products. 
IT 153926-89-7 * 

RL: BOC (Biological occurrence) / BSU (Biological study, unclassified) ; BUU 
(Biological use, unclassified); PRP (Properties); BIOL (Biological study); 
OCCU (Occurrence); USES (Uses) 

(amino acid sequence; Schizochytrium polyketide synthase genes and 
transgenic plants for polyunsatd. long chain fatty acid prodn.) 
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AB The present invention provides DNA mols. that constitute fragments of the 
genome and cDNAs from Zea mays mays (HYBRID SEED #35A19) and Arabidopsis 
thaliana (ecotype Wassilewski ) , and polypeptides encoded thereby. The DNA 
mols. are useful for specifying a gene product in cells, either as a 
promoter or as a protein coding sequence or as an UTR or as a 3 1 
termination sequence, and are also useful in controlling the behavior of a 
gene in the chromosome, in controlling the expression of a gene or as 
tools for genetic mapping, recognizing or isolating identical or related 
DNA fragments, or identification of a particular individual organism, or 
for clustering of a group of organisms with a common trait. Arabidopsis 
DNA is used in the present expt . , but the procedure is a general one. 
Protocols are provided for Southern hybridizations and transformation of 
carrot cells. [This abstr. record is one of 15 records supplemental to 
CA13316218528Q necessitated by the large no. of index entries required to 
fully index the document and publication system constraints.]. 
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AOCS Press 
Journal 
English 
Entered STN: 06 Nov 2000 

The eicosapentaenoic acid (EPA) synthesis gene cluster from an 
EPA-producing bacterium, Shewanella sp. SCRC-2738, was cloned into a 
broad-host range vector, pJRD215, and then introduced into a marine 
cyanobacterium, Synechococcus sp. NKBG15041c, by conjugation. The 
transconjugant cyanobacteria produced 3.7 .+-. 0.2% (2.24 . +-. 0.13 mg/L) 
EPA (n-3) and 2.5 .+-. 0.2% (1.49 .+-. 0.06 mg/L) eicosatetraenoic acid 
(n-3) of the total fatty acids when the cells were cultured at 23. degree. C 
at a light intensity of 1,000-1,500 Lx. The EPA and eicosatetraenoic acid 
contents of the cells were increased to 4.6 0.6% (3.86 . +-. 1.11 

mg/L) and 4.7 . +- . 0.3% (3.86 .+-. 0.82 mg/L), and 7.5 .+-. 0.3% (1.76 
.+-. 0.10 mg/L) and 5.1 .+-. 0.2% (1.19 .+-. 0.06 mg/L) when they were 
cultured at low temp. ( 18 . degree . C) and at lower light intensity (40 Lx) , 
resp . 
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RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; eicosapentaenoic acid prodn. by a recombinant 

Synechococcus sp.) 
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(EPA) , which process comprises isolation of genes coding for a group of 
EPA biosynthesis enzymes from Shewanella putrefaciens strain SCRC-2874 and 
expression of the enzymes in Escherichia coli. Upstream of open reading 
frame ORF2, downstream of ORF10, and ORF2 are not involved with the 
biosynthesis of EPA. ORF3, 6, 7, 8, and 9 are essential for the 
biosynthesis of EPA. EPA is useful as pharmaceuticals, agrochems . , foods, 
and feeds . 
I T 153926-89-7 - 

RL : BAC (Biological activity or effector, except adverse); BSU (Biological 
study, unclassified) ; BUU (Biological use, unclassified) ; PRP 
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(amino acid sequence; cloning of genes for 9 eicosapentaenoic acid 
synthesizing enzymes of Shewanella putrefaciens and expression of genes 
in Escherichia coli for prodn. of eicosapentaenoic acid) 
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Thomson, B. J.; Martin, M . E. D.; Efstathiou, S.; 
Craxton, M. ; Macaulay, H. A. 

Dept. Clinical Sci., London Sch. Hygiene and Tropical 
Med., London, WC1E 7HT, UK 
Virology (1995), 209(1), 29-51 
CODEN; VIRLAX; ISSN; 0042-6822 
Academic 
Journal 
English 
Entered STN: 24 May 1995 

The complete DNA sequence was detd. for strain U1102 of human 
herpesvirus-6, a CD4+ T-lymphotropic virus with disease assocns. in 
immunodef icient settings and a possible complicating factor in AIDS. The 
genome is 159,321 bp in size, has a base compn. of 43% G + C, and contains 
119 open reading frames. The overall structure is 143 kb bounded by 8 kb 
of direct repeats, DRL (left) and DRR (right), contg. 0.35 kb of terminal 
and junctional arrays of human telomere-like simple repeats. Since eight 
open reading frames are duplicated in the repeats, six span repetitive 
elements and three are spliced, the genome is considered to contain 102 
sep. genes likely to encode protein. The genes are arranged colinearly 
with those in the genome of the previously sequenced betaherpesvirus, 
human cytomegalovirus, and has a distinct arrangement of conserved genes 
relative to the sequenced gammaherpesviruses, herpesvirus saimiri and 
Epstein-Barr virus, and the alphaherpesviruses , equine herpesvirus-1, 
varicella-zoster virus, and herpes simplex virus. Comparisons of 
predicted amino acid sequences allowed the functions of many human 
herpesvirus-6 encoded proteins to be assigned and showed the closest 
relation in overall no. and similarity to human cytomegalovirus products, 
with approx. 67% homologous proteins as compared to the 21% identified in 
all herpesviruses. The features of the conserved genes and their relative 
order suggested a general scheme for divergence among these herpesvirus 
lineages. In addn. to the "core" conserved genes, the genome contains 
four distinct gene families which may be involved in immune evasion and 
persistence in immune cells: two have similarity to the "chemokine" 
chemotactic/proinf lammatory family of cytokines, one to their peptide 
G-protein-coupled receptors, and a fourth to the Ig superfamily. 
167975-38-4 

RL: PRP (Properties) / 

(amino acid sequence; DNA and encoded peptide sequences of human 
herpesvirus-6) 
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CORPORATE SOURCE: Johns Hopkins Oncol. Cent., Baltimore, MD, 21231, USA 

SOURCE: Journal of Virology (1994), 68(2), 597-610 

CODEN: JOVIAM; ISSN: . 0022-538X 
Journal 
English 
Entered STN: 16 Apr 1994 

Human herpesvirus 6 (HHV-6) is prevalent in the human population, with 
primary infection occurring early in life. Its predominant CD4 + 
T -lymphocyte tropism, its ability to activate human immunodeficiency virus 
type. 1 (HIV-1) gene expression in vitro, and its upregulation of CD4 
expression has led to speculation that HHV-6 may act as a pos . cof actor in 
the progression of HIV infection to AIDS in individuals infected with both 
viruses. Previous sequencing studies of restricted regions of the 
161.5-kbp genome of HHV-6 have demonstrated unequivocally that it is a 
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member of the betaherpesvirus subgroup and have indicated that the HHV-6 
genome is generally collinear with the unique long (UL) component of human 
cytomegalovirus (HCMV) . This report extended the sequencing studies by 
detg. the primary structure of 38.5-kbp of the HHV-6 genome (genomic 
position 21.0 to 59.6 kbp) . Within the sequenced region lie 31 open 
reading frames, 20 of which are homologous to positional counterparts in 
HCMV. Of particular significance is the identification of homologs of the 
HCMV UL36-38 and US22-type genes, which have been shown to encode 
transactivating proteins. DNA sequences encoding these HHV-6 homologs 
were able to transactivate HIV-1 long terminal repeat-directed 
chloramphenicol acetyltransf erase expression in cotransf ection assays, 
thus demonstrating functional as well as structural conservation of these 
betaherpesvirus-specif ic gene products. The data confirm the close 
relationship between HHV-6 and HCMV and possibly also in the interactions 
between HHV-6 and HIV in dually infected cells. 
153676-37-0, Genbank L25528-derived protein 
RL: PRP (Properties) 

(amino acid sequence of) 
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parts thereof having its specific immunological properties. It further 
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Human herpesvirus-6 glycoprotein H and L homologs are 
components of the gplOO complex and the gH external domain 
is the target for neutralizing monoclonal antibodies. 
Liu D X; Gompels U A; Foa-Tomasi L; Campadelli-Fiume G 
Department of Medicine, University of Cambridge, United 
Kingdom . 
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Previous studies have shown that monoclonal antibody (MAb) 2E4 neutralizes 
infectivity of human herpesvirus-6 (HHV-6) and also inhibits virus-induced 
T-lymphocyte syncytia formation. Here we characterize two additional MAbs, 1D3 
and 5E7, which have similar properties, and identify the glycoprotein targets. 
The MAbs could immunoprecipitate and immunofluorescence glycoprotein from both 
A and B variant strain groups of HHV-6. In reactions with infected cells the 
MAbs immunoprecipitated a complex of glycoproteins, the "gplOO" complex, 
composed of a major glycoprotein species of 100,000 M(r) and minor components 
of 80,000 M(r) and 32,000 M(r). We show that the 100,000 M(r) product and most 
likely the 80,000 M(r) correspond to the HHV-6 homologue of herpes simplex 
virus-1 (HSV-1) glycoprotein H (gH) while the 32,000 M{r) species corresponds 
to the glycoprotein L (gL) equivalent. All three MAbs could specifically 
immunoprecipitate either gH expressed on its own in fibroblasts or a complex of 
gH and gL co-expressed, but could not immunoprecipitate gL expressed on its 
own. Consistent with these results, the MAbs could recognize gH in an 
immunofluorescence assay but not gL. Therefore although the MAbs recognized 
the complex of glycoproteins, they appeared specific for gH. The HHV-6 
glycoproteins were produced in a transient expression system induced by 
T7-vaccinia virus. Immunoprecipitations were carried out in comparisons with 
an "epitope-tagged" gH, a recombinant glycoprotein designed to contain at the 
N-terminus the linear epitope for MAb LP14, raised originally against HSV-1 
glycoprotein gD. The epitope-tagged gH was also used as a positive control in 
determining the domain of HHV-6 gH to which MAbs 2E4, 1D3 and 5E7 were 
directed.- Two gH deletions were constructed, one deleting sequences which may 
serve as a transmembrane and cytoplasmic anchor domains, the second deleting 
also part of the external domain. MAb LP14 could immunoprecipitate both HHV-6 
gH deletions but the gplOO MAbs recognized only the full-length product or the 
intact external domain minus the transmembrane and cytoplasmic domains. This 
indicated the epitopes for these MAbs are contained in the external domain of 
gH, consistent with the MAbs action in neutralization of virion infectivity and 
inhibition of virus to ce-11 spread by T-lymphocyte fusion. 
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S-layer protein from Thermus thermophilics HB8 assembles 
into porin-like structures. 

Caston J R; Berenguer J; de Pedro M A; Carrascosa J L 
Centro de Biologia Molecular (CSIC-UAM) , Universidad 
Autonoma de Madrid, Cantoblanco Spain. 
Molecular microbiology, (1993 Jul) 9 (1) 65-75. 
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ABSTRACT: 

The cells of the extreme thermophile Thermus thermophilus are surrounded by a 
regular layer (S-layer) built up by a protein with an apparent molecular mass 
of 100 kDa (PlOO) . . From purified membrane fractions, three different class of 
two-dimensional crystals can be obtained by following alternative extractive 
procedures. One of these crystals, with p6 symmetry, clearly represents the 
native S-layer detected by freeze etching on whole cells, while the other two, 
showing p2 and p3 symmetries respectively, closely resemble aggregates of 
bacterial porins . We demonstrate here by limited proteolysis and Western 
blotting the surprising fact that the protein component of the three crystals 
is the PlOO protein. Our biochemical data also show how this protein forms 
Ca (2+) -stabilized trimers in each crystal, which support the structural 
analysis that points to p3 units as the common structural block in all of them, 
and again resembles the situation found in bacterial porins. 
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AAA46012 
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Human herpesvirus 6 
Human herpesvirus 6 

Viruses; dsDNA viruses, no RNA stage; Herpesviridae; 
Betaherpesvirinae; Roseolovirus . 
1 (residues 1 to 870) 

Neipel,F., Ellinger,K. and Fleckenstein, B . 

Gene for the major antigenic structural protein (plOO) of human 
herpesvirus 6 

J. Virol. 66 (6), 3918-3924 (1992) 

92260671 

1374813 

Method: conceptual translation. 

Location/Qualifiers 
1..870 

/organism="Human herpesvirus 6" 
/db_xref="taxon: 10368" 
1. .870 

/name="pl00" 
1. .870 

/gene-"pl00" 

/coded_by-"M87287. 1:639. .3251" 



ORIGIN 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 



// 



mdlqrhpipf 
ynllvlwlmy 
sqf sdinral 
dpwdengyli 
vyshrvlqtp 
gdihsnltlp 
lkeatlggen 
igpsgildfn 
ppwsrntaan 
kdlrgkdgf r 
andf f aglnk 
gdarnagdeq 
eseeanveki 
pgngereidl 
dgetdentvs 



awldrdkver 
hyvlskrkpd 
lrlgnf ikwg 
tsinkliylg 
ltgqrvesf 1 
smsidtkkss 
vkattpassv 
vkfppnaqln 
adf ldpvhrf 
kqkqsdipks 
kndrdvlrgg 
yiqaglgqrv 
levsnpqdmf 
alekvkvget 
gpgvaesldi 



ltdf lsnler 
ynaiwqditk 
snvaidtpyv 
klf laltqsw 
kiitsdydii 
ldparlkksn 
slmsgvesps 
tngvdllqdk 
vpeqtgtpf v 
ltkerndkai 
kgnskdlhsg 
nnllsqf tnl 
knf rlqndld 
etsdlkvgqd 
eakgesaias 



ldnvdlrehp 
lqsvvneyln 
nltaedssei 
sklekvamsq 
kssleshsas 
srsldsf lrm 
sf tstnldlp 
tsigspssgi 
lnnsdvagse 
mhsrevtgds 
gnakkkemsg 
islgekgied 
svqspf rlpd 
esf vpaqlmk 



hvtnscvvre 
skglnkgife 
ennlqdaekn 
ivitqnhlsg 
kaf smsei gp 
qrqpkf leld 
lssf tstnld 
tdvvngf anl 
akhttystet 
gdatetvgar 
kfnddkemtr 
ilqnqrgtel 
adlsreldsa 
vetpeekddi 



gddvddlktl 
nmf tnkekf e 
mlwytvynin 
hlrrhdnf ni 
nslmdfvplr 
svdnagekil 
lrdkshgnyk 
nlhqnksnvs 
gvsprnvf li 
nspalrkikq 
ngqepsrslm 
klatenksgr 
sfkdaldlkl 
ieqmvlrirq 
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□ 1: BAB17787 . mucin [Rattus nor...[gi: 11 138240] BLink, Domains, Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



Protein 



CDS 



BAB17787 1851 aa linear ROD ll-NOV-2000 

mucin [Rattus norvegicus] . 

BAB17787 

BAB17787.1 GI:11138240 
accession AB042530 . 1 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (sites) 

Oinuma,T. and Suganuma,T. 

Rat gastric mucin MucSAC: Sequence of its 5 1 -region contains 
conserved D-domains and two leucine zipper motifs 
Unpublished 

2 (residues 1 to 1851) 
Oinuma,T., Oinuma,T. and Suganuma,T. 
Direct Submission 

Submitted ( 09-MAY-2000 ) Tsutomu Oinuma, Miyazaki Medical College, 
Department of Anatomy; Kihara 5200, Kiyotake, Miyazaki 8891692, 
Japan (E-mail : tu@gray . miyazaki-med. ac.jp, Tel : 81985851784, 
Fax:81985858406) 

Location /Qualifiers 

1..1851 

/organism="Rattus norvegicus" 
/strain="Wistar" 
/db_xref="taxon: 10116" 
/ sex="male" 

/tissue_type=" stomach" 
1. .1851 

/product = "mucin" 
1..1851 

/gene-"Muc5AC" 

/coded_by="AB042530 .1:28.. >5580" 



ORIGIN 



ehysdlsqpq ghvgtplnrv 
yfpglcnyvf sehcgaayed 



1 mlhsmgvgrr klapfwvlal altfnqhtgq aledtrkshl 

61 tiipplktip vvrafnpaht rrvcstwgnf hyktfdgqvf 

121 fniqlrrgle snsttlsrvi mkldglvvel tkssvlvnnh pvqlpfsqsg vlielsngyl 

181 kvvarlglvf mwndddslll eldtkyankt cglcgdfngs pesseflshn vrltplefgn 

241 fqkmdgpteq cqdplpvpqk ncsirssice eilkgqlfsn caalvdissy leacqqdlcl 

301 cessdpsnci chtlaeysrq cahaggqpqn wrgpnlcpqt cllnmeyqec gspcvdtcsn 

361 pqhsqvcedh cvagcfcpeg mvlddsnqtg cvpvsqcacl yngtlyapgt systdctkct 

421 csggqwscqe vpcsgtcsvm ggshistfde rqytvhgdcs yvlckpydsn aftvlaelrk 

481 cgltesetcl ktvtlnlggg ktvitvkatg evfvnqiytq lpvstanatf frpstffiig 

541 qtnlglqlei qlhpimqvsv riapefrglt sglcgnfnsm qaddfqtisg vvegtaaaff 

601 ntfktqaacp nvknifedpc slsvenekya qhwcsqltda ngpfsqchat vnpstffsnc 

661 mfdtcnceks edclcaalss yvracaakgv llsdwregic akptitcpks mtyqyhistc 

721 qptcrslsee dvtchvnfip vdgctcpkgt flddsgkcvq atscpcyykg spvpngesvh 

781 dngaictctq galtciggpv Itpvcdapmi yfdcrnatpg dtgagcqksc htldmtcyss 

841 ecvpgcvcpn glvadgngsc vvaedcpcvh neatyrpget iqvgcnnctc enrmwqctdk 

901 pclatcavyg dghyitfdgq rysfsgdcey tllqdncggn gssqdafrvv tenipcgttg 
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961 ttcskgikif lgsyelklsd 
1021 tsiflrlspe fkgkvcglcg 
1081 kdpctanpyr kswaqkqcsi 
1141 avaayaqach evgvcvswrt 
1201 qdlrglegcy pkcpptapif 
12 61 vctesgvnct hdagacvcty 
1321 stsppppttf sf sttlvmts 
1381 vlttstsast ltmpacqeec 
14 41 ecraednpgv pfhalqqhve 
1501 tqtthlivsr tstmedttss 
1561 ptpvssttvk ttlpttspmp 
1621 sypepgrssg dfdtfvnlra 
1681 clnknqlppi cynyeiriec 
1741 tsghstvint ktwvtgstht 
1801 rvstthttht sspptggtsp 



skmevvqkgv gqeppyfvhq 
nfddnaindf ttrsqsvvsd 
insaafsach ahvepakyye 
pdicplfcdy ynpegqcewh 
degtmqcvsn ctvspspcrv 
ngqryhpgdt iyhttdgmgg 
mqpssthssp tpsvvypgsp 
lwspwmdisr pgrgidsgdf 
csttvglicy nsdqvsglcd 
vpvtstehty stvasspsth 
eptsatssvs istlgstlas 
kgykfcekpw nvecraqffp 
ctivnicstt sattqptshg 
tpqpgtrptp sivstqdtst 
tstthtsspn tggtsptstt 



mgnylvvetd iglvllwdkk 
mlef gnswkl spscpdasvs 
acvndacacd sggdcecfct 
yqpcgapcmr tcqnptgqcl 
ngklyrpgtp ipsdencysc 
cisahcrdng tierivdtcs 
skavltassv ssvktpetts 
dtlenlhahg yqicpvpkav 
nyqikiqcct pincptstgp 
tpgpspsssv psssaparst 
peithgcrke lcnwtdwidg 
ntplqelgqd vtcsrevgli 
vsiktktnwi tntysfsten 
ssvqtdstts sshtsspntg 
htsspxtggt s 
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